SemanticScuttle - klotz.me » klotz: large language models

klotz: large language models*

A collection of lightweight AI-powered tools built with LLaMA.cpp and small language models.

2024-11-07 Tags: smollm, smol_tools, llama.cpp, llm, self-hostexd, summarizer, rewriter, agent by klotz

The Geometry of Concepts: Sparse Autoencoder Feature Structure

This paper explores the structure of the feature point cloud discovered by sparse autoencoders in large language models. It investigates three scales: atomic, brain, and galaxy. The atomic scale involves crystal structures with parallelograms or trapezoids, improved by projecting out distractor dimensions. The brain scale focuses on modular structures, similar to neural lobes. The galaxy scale examines the overall shape and clustering of the point cloud.

2024-11-06 Tags: autoencoder, features, llm, scale by klotz

Treating AI Agents as Personas

The article discusses the emerging role of AI agents as distinct users, requiring designers to adapt their practices to account for the needs and capabilities of these intelligent systems.

- Agents are becoming active users in systems, requiring designers to extend UX principles to include both humans and A and agents.
- The future of UX lies in understanding and designing for Agent-Computer Interaction.

2024-11-06 Tags: agents, personas, ux, llm by klotz

Classify Jira Tickets with GenAI On Amazon Bedrock

Replace traditional NLP approaches with prompt engineering and Large Language Models (LLMs) for Jira ticket text classification. A code sample walkthrough.

2024-11-05 Tags: jira, ticket, classification, amazon bedrock, llm, aws, text by klotz

Running Large Language Models Privately

A comparison of frameworks, models, and costs for deploying Llama models locally and privately.

- Four tools were analyzed: HuggingFace, vLLM, Ollama, and llama.cpp.
- HuggingFace has a wide range of models but struggles with quantized models.
- vLLM is experimental and lacks full support for quantized models.
- Ollama is user-friendly but has some customization limitations.
- llama.cpp is preferred for its performance and customization options.
- The analysis focused on llama.cpp and Ollama, comparing speed and power consumption across different quantizations.

2024-11-03 Tags: llm, self-hosted, huggingface, vllm, ollama, llama-2 by klotz

All Hands AI Open Sources OpenHands CodeAct 2.1: A New Software Development Agent to Solve Over 50% of Real GitHub Issues in SWE-Bench

All Hands AI has released OpenHands CodeAct 2.1, an open-source software development agent that can solve over 50% of real GitHub issues in SWE-Bench. The agent uses Anthropic’s Claude-3.5 model, function calling, and improved directory traversal to achieve this milestone.

2024-11-02 Tags: llm, github, swe-bench, openhands, agent, software by klotz

Visa Has Deployed Hundreds of AI Use Cases and It’s Not Stopping

Visa is leveraging artificial intelligence across numerous aspects of its operations, with no plans to slow down its implementation.

2024-11-02 Tags: visa, ai, llm, fintech by klotz

DS4SD / docling

Docling is a tool that parses documents and exports them to desired formats like Markdown and JSON. It supports various document formats including PDF, DOCX, PPTX, Images, HTML, AsciiDoc, and Markdown.

2024-11-01 Tags: docling, ibm, document, parsing, markdown, json, pdf, docx, pptx, ocr, llm by klotz

Can a decoder-encoder model be quickly fine-tuned to translate Egyptian Middle Kingdom hieroglyphics?

The post discusses the feasibility of fine-tuning a decoder-encoder model to translate Egyptian Middle Kingdom hieroglyphics into English. The author suggests that with sufficient training data and a tokenizer that includes Egyptian characters, the model could learn to interpret hieroglyphics fluently. Comments from users mention using plugins and existing knowledge in models as alternatives to fine-tuning.

2024-11-01 Tags: fine-tuning, egyptian, hieroglyphics, translation, llm, reddit by klotz

A brief summary of language model finetuning

This article summarizes various techniques and goals of language model finetuning, including knowledge injection and alignment, and discusses the effectiveness of different approaches such as instruction tuning and supervised fine-tuning.

2024-11-01 Tags: llm, finetuning, instruction tuning, knowledge injection, alignment, supervised fine-tuning, relief by klotz

SemanticScuttle - klotz.me

klotz: large language models*

Linked Tags

Related Tags